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Abstract 



Let G be a compact connected Lie group with trivial cen- 
ter. Using the action of G on its Lie algebra, we define an op- 
■ erator norm | \g which induces a bi- invariant metric dG{x,y) = 

\Ad{yx"^)\G on G. We prove the existence of a constant (3 « .12 
(independent of G) such that for any closed subgroup H C. G, 



I the diameter of the quotient G/H (in the induced metric) is > (3. 



! 1 Introduction 

O 

Finding a lower bound to the (operator norm) diameter of homogeneous 
spaces G/H, G compact is a natural geometric problem. It can also be 



p • motivated by considering quantum computation. In standard models 

[NC] the state space of a (theoretical) quantum computer is a Hilbert 
C I space vi^ith a tensor decomposition, (C^)®". A "gate" is a local unitary 

^ ' operation acting on a small number, perhaps two, tensor factors (and 

O^' as the identity on the remaining factors). One often wonders if a certain 

set of local gates is "universal" meaning that the closed subgroup H 
they generate satisfies U{1)H = [7(2"'). We produce a constant (3 « .12 
so that diam U{2^)/U{\)H < fi implies universality, where diameter 
5^ , is to be computed in the operator norm. This norm is well-suited here 

because it is stable under CSid- 

Because the operator norm is bi-invariant it suffices to check that 
every element b in the ball of radius 2(3 about the identity of SU{2^) 
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has Ball/3 (6) n H ^ (J). In principle this leads to an algorithm to test 
if a gate set is universal. Such an algorithm will be exponentially slow 
in n. But often it is assumed that identical gates can be applied on 
any pair of factors; in this case universality for n = 2 is sufficient to 
imply universality for all n. 

Let G be a compact Lie group with trivial center. The semisim- 
plicity of G implies that the (negative of the) Killing form is a natural 
positive-definite, bi-invariant inner product on the Lie algebra g of G. 
We let \\x\\g denote the induced (Euclidean) norm on g. We use this 
to define the operator norm on G as follows: 

\g\G= sup \Z{y,Adgy)\ 
IMIb=i 

where Z{y,Adgy) denotes the usual Euclidean angle between the vec- 
tors y and Ad^ y, normalized so that it lies in the interval [— vr, vr]. Since 
angles between vectors in a Euclidean space obey a triangle inequal- 
ity, we deduce the inequality \gh\G < \g\G + |^|g- It is also clear that 
I^Ig = if and only if Ad^ is the identity, which implies that g is the 
identity since the adjoint action of G is faithful up to the center of G, 
and we have assumed that the center of G is trivial. 

We define a distance on G by the formula dG{g,g') = \g^^g'\G- It 
is easy to check that this defines a bi-invariant metric on G, where all 
distances are bounded above by vr. Note that do is continuous on G, 
hence there is a continuous bijection from G with its usual topology 
to G with the topology induced by dc?. Since the source is compact 
and the target Hausdorff (this fails if G has nontrivial center, since the 
operator norm of a central element is equal to zero), we deduce that 
the metric determines the usual topology on G. 

For any closed subgroup H of G, the homogeneous space G/H 
inherits a quotient metric given by the formula 

dG/H{P,Q) = inf<iG(p,g) = inf \g\G 

' 9P=<1 

where the first infimum is taken over all pairs p,q E G lifting the pair 
p,q E G/H. Note that if H is contained in H', then the diameter of 
G/H is at least as large as that of G/H'. 

We are now in a position to state the main result: 
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Theorem 1. Let G be a compact connected Lie group with trivial center 
and H G a proper compact subgroup of G. Then the diameter of 
G/H with respect to the metric dcju is no smaller than 13, where (5 is 
the smallest real solution to the transcendental equation cos^ (« — /?) + 
sin^(Q; — (3) sin(/?) = cos(4/3) and cos(a) = |. 

One can estimate that the constant (3 is approximately .124332. 

Example 2. Consider the case where G = H x H is a product, and 
H is embedded diagonally. Choose an element h E H with \h\H = tt 
(such an element exists in any nontrivial one parameter subgroup). 
Then in H x H, the distance duxuih xO,h' x h') is equal to the larger 
of dH{h,h') and dH{h',e). By the triangle inequality, this distance is 
at least |. It follows that the diameter of G/iJ is at least |. 

Remarks: 

(1) For any orthogonal representation t : G —=> 0{y) of a group G, we 
can define an operator norm on G with respect to V: 

\9\g,t = sup \A{v,gv)\ 

\\v\\=l 

This construction has the following properties: 

• If F is the complex plane C, and g € G acts by multiplication 
by where — tt < a < tt, then \g\G,T = |"|- 

• Given any subgroup H C G, the restriction of | |g,t to H is 
equal to | \h,t\h- 

• The operator norm associated to a direct sum of represen- 
tations Tj of G is the supremum of the operator norms asso- 
ciated to the representations Tj. 

• In particular, the operator norm on G associated to a rep- 
resentation V is identical with the operator norm on G as- 
sociated to the complexification V <Sir C (with its induced 
Hermitian structure). 

• To evaluate l^/lcr, we can replace G by the subgroup gener- 
ated by g and V by its complexification, which decomposes 
into one-dimensional complex eigenspaces under the action 
of g. We deduce that \g\G,T is the supremum of |logAj|, 
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where {Aj} is the set of eigenvalues for the action of 5 on y 
(and the logarithms are chosen to be of absolute value < vr). 

(2) The reader may be curious about the diameter of G/H relative to 

the Riemannian quotient of the Killing metric dx- If we let N 
denote the dimension of g, then we have 

?,md 

d<dK< 

(3) We ask if the quotient S0{3)/I is the homogenous space of small- 

est diameter, where / ~ As denotes the symmetry group of the 
icosahedron. 

(4) We wonder if there is a similar universal lower bound to the di- 

ameter of double coset spaces K\G/H^ G as above, K, H C G 
closed subgroups. Our method does not apply directly. 

(5) Although suggested by a modern subject the theorem could easily 

have been proved a hundred years ago and in fact may have been 
(or may be) known. 

2 Small Subgroups 

Throughout this section, G shall denote a compact, connected Lie 
group with trivial center. We give a quantitative version of the principle 
that discrete subgroups of G generated by "sufficiently small" elements 
are automatically abelian. We will use this in the proof of Theorem |l] 
in the case where H is discrete. 

We will need to understand the operator norm on G a bit better. 
To this end, we introduce the operator norm 

\x\g= sup ||[2;,y]||g 

on the Lie algebra g of G. This is a G-invariant function on g, so we 
can unambiguously define the operator norm of any tangent vector to 
the manifold G by transporting that tangent vector to the origin (via 
left or right translation) and then applying x \x\g. 

The operator norm on g is related to the operator norm on G by 
the following: 
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Lemma 3. The exponential map x i— > cxp(.T) induces a bijection be- 
tween Qo = {x ^ g : \x\g < ^} and Gq = {g & G : \g\G < This 
bijection preserves the operator norms. 

Proof. First, we claim that the map x ^ exp(x) does not increase 
the operator norm. This fohows from the fact that the eigenvalues of 
exp(a;) have the form exp(K), where k, is an eigenvalue of x. It follows 
that the exponential map sends go into Gq. 

Choose g Gq, and fix a maximal torus T containing g. Let t be the 
Lie algebra of T. Decompose ^iXirC into eigenspaces for the action of T: 
0(8>rC = t(8)KC©0Q,ga. The element g acts by an eigenvalue A (a) on 
each nonzero cigcnspace Qa- Since g is an orthogonal transformation, 
we may write A{a) = e^^^°^) . Since g G Go, it is possible to choose the 
function A so that — ^ < A(a) < ^ for each root a. This determines 
the function A uniquely. 

Choose a system A of simple roots, and let x be the unique element 
of t such that a{x) = \{a) for each a € A. It follows immediately that 
exp(a;) = g (since G has trivial center). To show that x € 50) we need 
to show that |a(x)| < ^ for all roots a. For this, it will suffice to prove 
that q(x) = X{oi) for all roots a. 

The uniqueness of A implies immediately that A(— q) = —\{a). 
Thus, it will suffice to prove that the equation a{x) = A(a) holds when 
a is positive (with respect to the root basis A). Since the equation is 
known to hold whenever a G A, it will suffice to prove that a{x) = 
A(a), l3{x) = A(/3) implies 

(a + /3)(x) = A(a + /3). 

In other words, we need to show that the quantity 

e = A(a + /3) - A(a) - A(/3) 

is equal to zero. By construction, |e| < 27r. On the other hand, since 
A(q;)A(/3) = A(q; + P), we deduce that e**^ = 1, so that e is an integral 
multiple of 27r. It follows that e = 0, as desired. 

It is clear from the construction that \x\g = \g\G- To complete the 
proof, we need to show that g has no other logarithms lying in go- This 
follows from the fact that any unitary transformation (in particular, the 
adjoint action of 5 on g) which does not have —1 as an eigenvalue has 
a unique logarithm whose eigenvalues are of absolute value < tt. □ 
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Lemma 4. Let p : [0,1] ^ G be a smooth function with p{0) equal to 
the identity of G. Then |p(1)|g' < |p'(t)|g(it. 

Proof. For N sufficiently large, we can write p{^^) = p(j^)exp(^), 
where is approximately equal to the derivative of p at jj. Thus, as N 
goes to cxD, the average l^olf+'-'+l^^-ils converges to the integral on the 
right hand side of the desired inequality. By the triangle inequality, it 
will suffice to prove that \p{jj)~'^p{^}-)\g < l^lg- If ^ is sufficiently 
large, then this follows immediately from Lemma ^. □ 

Remark 5. The metric do on G is not necessarily a path metric: 
given g,h G G, there does not necessarily exist a path in G having 
length equal to dcig^h). However, it follows from Lemma ^ that da 
is a path metric locally on G. The length of a (smooth) path can be 
obtained by integrating the operator norm of the derivative of a path. 
Replacing do by the associated path metric only increases distances, 
so that Theorem || remains valid for the path metric associated to dc- 
This modified version of Theorem |^ makes sense (and remains true) for 
compact Lie groups G with finite center. 

We can now proceed to the main result of this section. Let a denote 
the smallest positive real number satisfying cos(a) = |. 

Theorem 6. Let H C G be a discrete subgroup. Let h,k & H and 
suppose \h\G < f , \k\G < a- Then [h,k] = 1. 

Proof. We define a sequence of elements of G by recursion as follows: 
ho = h, hn+i = [hn, k]. Let C satisfy the equation ^ = 2 — 2 cos |/c|g- 
Then the assumption on k ensures that C < 1. Our first goal is to 
prove that the operator norm of the sequence {/in} obeys the estimate 
\hn\G < C"^f • For n = 0, this is part of our hypothesis. Assuming 
that the estimate \hn\G < C"^ is valid, we can use Lemma ^ to write 
hn = exp(x), [x|g < C"^. Now define p{t) = [exp(te),fc], so that 
p(0) = 1 and p{t) = hn+i- 

Using Lemma 1^, we deduce that |/in+i|G < /o iy(^)lg ^ sup^ |p'(t)|g. 
On the other hand, the vector p'{t) can be written as a difference 



6 



where Rg and Lg denote left and right translation by g. We obtain 



|P'(^)|0 — 1^ ^dexp(te)fcexp(-t2:) ^Ig 

= \x-Adkx\g 

= sup||j^||^=i Adfcx,y]||0 

< sup||j^||^=i(||[x,y] - Adk[x,y]\\Q + \\Adk[x,y] - [AdkX,y]\\Q) 

< su pii^ii^^i \\[x,y ] -^4[x,y]||0 + sup||^||^=i \\[x,y - Ad-'^y]\\Q 

< \/ 2-2cos[% sup||^||^^i l|[2;,y]||g+ [xlgsupiiyii^^i \\y - Ad^^i 

< 2y^2 — cos |A;[G|2;[g 

= C|x|g 

as desired. 

It follows that the operator norms of the sequence converge to 
zero. Therefore the sequence {hn} converges to the identity of G. Since 
-ff is a discrete subgroup, it follows that hn is equal to the identity if n 
is sufficiently large. We will next show that /i„ = 1 for all n > 0, using 
an argument of Frobenius which proceeds by a descending induction 
on n. Once we know that hi = 1, the proof will be complete. 

Assume that /in+i = 1. Then k commutes with hn, and therefore 
also with hnk = hn-ikh~^^. It follows that g(8)]jC admits a basis whose 
elements are eigenvectors for both k and hn-ikh~^^. If the eigenvalues 
are the same in both cases, then we deduce that k = hn-^ikh^^^, so 
that hn is the identity and we are done. Otherwise, there exists v G 
A^rC which is an eigenvector for both k and hn-ikh~^i, with different 
eigenvalues. Equivalently, both v and hn-iv are eigenvectors for k, with 
different eigenvalues. Thus v and hn-iv are orthogonal, which implies 
l^n-ilc > f ) a contradiction. □ 



3 The Proof when H is Discrete 

In this section, we will give the proof of Theorem || in the case where 
-fT is a discrete subgroup. The idea is to show that if G/H is too 
small, then H contains noncommuting elements which are close to the 
identity, contradicting Theorem ^. 

In the statements that follow, we let a denote the smallest positive 
real solution to cos (a) = | and (3 the smallest positive real solution 
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to the transcendental equation cos^(a — /?) + sin^(a — /5) cos(| — P) = 
cos(4/?). 

Lemma 7. Let G be a compact, connected Lie group with trivial center. 
Then there exist elements h,k £ G having the property that for any 
h',k' € G with dG{h,h'),dc{k,k') < (3, we have |/i'|g < f; W\g < 
and [h',k'] / 1. 

Proof. Choose a (local) embedding p : SU (2) G corresponding to a 
root of some simple component of G. We will assume that if the relevant 
component has roots of two different lengths, then the embedding p 
corresponds to a long root. This ensures that the weights of SU{2) 
acting on g are no larger than the weights of the adjoint representation. 

In the Lie algebra so(3) of SU{2), we let x and y denote infinitesimal 
rotations of angles f — /? and a — /3 about orthogonal axes. Then, by 
the above condition on weights, we deduce that h = p(exp(x)) and 
k = p{exp{y)) satisfy the conditions = f ~ /^j I^Ig = a — (3. 

We claim that the pair h,k G satisfies the conclusion of the 
lemma. To see this, choose any pair h' , k' £ G with d{h^ h'),d{k, k') < 
(3. Then we deduce |/i'|g < f > I^'Ig < a from the triangle inequality. To 
complete the proof, we must show that h' and k' do not commute. To 
see this, we let v denote the image in g of a vector in 5o(3) about which 
X is an infinitesimal rotation. Then hv = v, while Z{v,kv) = a — (3. 
Elementary trigonometry now yields 

Z(hkv,khv) = Z(hkv,kv) 

= cos~^(cos^(a - P) + sin^(a - /3) cos(| - /?)) 
= cos-i(cos(4/?)) = 4/3. 

By the triangle inequality, we get 

4(3 = Z{hkv,khv) 

< Z{hkv, h'kv) + Z{h'kv, h'k'v) + Z{h'k'v, k'h'v) 
+Z{k'h'v, k'hv) + Z{k'hv, khv) 

< Ap + Z{h'k'v, k'h'v), 

which implies Z{h'k'v, k'h'v) > so that [h' , k'] 7^ 1. □ 

We can now complete the proof of Theorem |^ in the case where H 
is discrete: 
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Proof. Choose h,k £ G satisfying the conclusion of Lemma |^ Since 
G/H has diameter less than /3, the cosets hH and kH are within /3 of 
the identity coset in G/H, which implies that there exist h' ,k' G H 
with d{h,h'),d{k,k') < (3. Lemma ensures that h' and k' do not 
commute, which contradicts Theorem |^. □ 

4 The Proof when G is Simple 

In this section, we give the proof of the main theorem in the case where 
H is nondiscrete and G is simple. The idea in this case is to show that 
because the Lie algebra f) of iJ cannot be a G-invariant subspace of g, 
the action of G automatically moves it quite a bit: this is made precise 
by Theorem Since t) is invariant under the action of H, this will 
force G/H to have large diameter in the operator norm. 

We begin with some general remarks about angles between sub- 
spaces of a Hilbert space. Let be a real Hilbert space, and let 
U,W Q V he hnear subspaces. The angle Z{U,W) between U and 
W is defined to be 

max( sup inf \Z{u,w)\, sup inf \Z{u,w))\). 

«GC/-{0}"'G^-{0} ^giy-{0}"e'^-{0} 

Note that for a fixed unit vector u (z U, the cosine of the minimal 
angle Z{u,w) with w (z W is equal to the length of the orthogonal 
projection of u onto W^. Thus, the sine of the minimal (positive) 
angle is equal to the length of the orthogonal projection of u onto W^. 
Consequently we have 

sin( sup inf \Z{u,w)\) = sup (ujW^) 

uGC/-{0} «'GH/-{0} ||n||=l,||w)i||=l 

which is symmetric in U and W^. From this symmetry we can deduce: 

Lemma 8. For any pair of subspaces U,W V , the angle Z{U, W) is 
equal to the angle Z{U^,W^). 

We will also need the following elementary fact: 

Lemma 9. Let V be a finite- dimensional Hilbert space, and let A be 
an endomorphism ofV having rank k. Then |Tr(yl)[ < k\A\. 



9 



Proof. Choose an orthonormal basis {'Ui}i<i<n for V having the prop- 
erty that Avi = for i > fc. Then 

\T,{A)\ = \Y,{vi,Av,)\< J2 \{vi,Av,)\< J2 \A\=k\A\ 

i l<i<k l<i<k 

□ 

We now proceed to the main point. 

Theorem 10. Let G be a compact Lie group acting irreducibly on a 
(necessarily finite dimensional) complex Hilbert space V. Let W ^ 
0, y he a nontrivial subspace. Then there exists g & G such that 
AW,9W) > f. 

Proof. Suppose, to the contrary, that Z(W,gW) < j for all g & G. Let 
V have dimension n. Replacing W by W-^ if necessary, we may assume 
that the dimension k of W satisfies k < ^.For any subspace U CV,we 
let Hfj denote the orthogonal projection onto U. 

For each g ^ G, projection from gW onto W-^ or from W-^ to gW 
shrinks lengths by a factor of sin Z{W, gW) < sin j at least. It follows 
that ^ 

Using the identity Ti{AB) = Tr{BA), we deduce 

^ 2- 

Integrating this result over G (with respect to a Haar measure which 
is normalized so that JqI = 1), we deduce 

Tr(( / Ugw)Uy^±) = I Tviligw^^w^) < f • 

On the other hand, /"^IIpM/ is a G-invariant element of End(y). 
Since V is irreducible, Schur's lemma implies that ^Qligw = Aly for 
some scalar A G C. We can compute A by taking traces: 

nA = Tr(Aly) 
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so that A = |. Thus = Tt{^U^) < |, so that 2(n - k) < n, a 

contradiction. □ 



From Theorem |10|, one can easily deduce the analogous result in the 
case when y is a real Hilbert space, provided that V C remains an 
irreducible representation of G. Using this, we can easily complete the 
proof of Theorem || in the case where G is simple and H is nondiscrete 
(with an even better constant). 

Proof. Let f) denote the Lie algebra of H. Since H ^ G and G is 
connected, f) £ g. Since H is nondiscrete, f) / 0. Since g (^m. C is an 
irreducible representation of G, we deduce that there exists g G G such 
that Z((7f),f}) > J. Now one deduces that for any h G H, gh' G gH., 
the distance 
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d{gh',h) = \gh'h-^\G > A{gh'h-\\)) = A{g\),^) > ^. 

It follows that the distance between the cosets gH and H in G/H is at 
least f . □ 



5 The General Case 

We now know that Theorem ^ is valid under the additional assumption 
that the group G is simple. We will complete the proof by showing how 
to reduce to this case. The main tool is the following observation: 

Proposition 11. Let ir : G ^ G' be a surjection of compact connected 
Lie groups with trivial center, let H he a closed subgroup of G and 
H' = Tr{H) its image in G' . Then d\aui{G' / H') < diam{G/H). 

Proof. For any points x',y' G G'/H', we can lift them to a pair of 
points x,y G G/H. It will suffice to show dQ/fj{x,y) > dc /h'{x' ,y')- 
The left hand side is equal to 

inf \g\G 

gx=y 

and the right hand side to 

,inf ,W\g'- 

g'x'=y' 
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To complete the proof, it suffices to show that \g\G > \'7T{g)\G'- This 
follows immediately since we may identify the Lie algebra q' of G' with 
a direct summand of g. □ 

Now assume that G is a compact, connected Lie group with trivial 
center. Then it is a product of simple factors {Ga}a€A- Let : G — > 
Ga denote the projection. Let H C G he a closed subgroup. If TTaH ^ 
Ga for some a £ A, then diam(G///) > d\sia.{Ga/T^oiH) > (3 and we 
are done. Otherwise, tTq, induces a surjection of Lie algebras — > fla for 
each a. By the structure theory of reductive Lie algebras, we deduce 
that f) = f)a © ^Q, where tTq is zero on and induces an isomorphism 

— 0a- Since f)a is therefore simple, may be characterized as the 
centralizer of \)a in f). 

Since H ^ G and G is connected, H must have smaller dimension 
than G. It follows that the subalgebras f)a C [3 cannot all be distinct. 
Choose a, a' £ A with f)^ = [3^/. The the map H Ga x Ga' is not 
surjective on Lie algebras. Without loss of generality, we may replace 
G by Ga X Ga' and H by its image in Ga x Ga'- 

Since the Lie algebra of H now maps isomorphically onto the Lie 
algebras of the factors Ga and Ga' , it follows that the connected com- 
ponent Hq of the identity in H is isomorphic to Ga, which is included 
diagonally in Ga x Ga'- Then H = Ho(H (1 {Ga x 1)). The intersection 
K = H n {Ga X 1) is normalized by Hq = {{g,g) : g E Ga}, hence it is 
normalized by Ga x {e}. Since Ga' is simple, we deduce that K = {e}. 
Thus H = Hq is embedded diagonally in Ga x Ga'- We have already 
considered this case in Example |2|, where we saw that the diameter of 
G'/H' is at least f . 

Remark 12. If we restrict our attention to the case where H is a 
connected subgroup of G, then our proof gives a better lower bound of 

7r 
4 ■ 
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